The current best estimate of the Galactocentric distance 
of the Sun based on comparison of different statistical 
- c s ) -. techniques 
o: 

<N ■ Zinovy Malkin M 

! ^ulkovo Observatory, St. Petersburg 196140, Russia 

00- 2 St. Petersburg State University, St. Petersburg 198504, Russia 

e-mail: malkin@gao.spb.ru 

Abstract 

Q_ ( - In this paper, the current best estimate of a fundamental Galactic parameter Galac- 

tocentric distance of the Sun Rq has been evaluated using all the available estimates 
| published during the last 20 years. Unlike some other studies, our analysis of these 

results showed no statistically significant trend in Rq during this period. However 
we revealed a statistically valuable improvement in the Rq uncertainties with time of 
about 0.2 kpc for 20 years. Several statistical techniques have been used and com- 
■ pared to obtain the most reliable common mean of 52 determinations made in 1992- 

2011 and its realistic uncertainty. The statistical methods used include unweighted 
mean, seven variants of the weighted mean, and two variants of median technique. 
\Q ■ The Ro estimates obtained with these methods range from 7.91 to 8.06 kpc with un- 

. certainties varying from 0.05 to 0.08 kpc. The final value derived in this analysis is 

O Ro = 7.98±0.15| stat ±0.20 I S y S t kpc, which can be recommended as the current best es- 

timate of the Galactocentric distance of the Sun. For most of the practical applications 
• • ■ the value of Rq = 8.0 ± 0.25 kpc can be used. 

x- 

^. 1 Introduction 

An accurate knowledge of the distance from the Sun to the center of the Galaxy, _R 0; is 
important in many fields of astronomy and space sciences such as establishing a distance 
scale in the Universe, studying the structure and dynamics of the Galaxy, determination 
of the distance to extragalactic objects, etc. In particular, the primary motivation for this 



study was a wish to improve the accuracy of modeling of the Galactic aberration (IMalkin 



2011af b 

During decades, many tens of scientifically meaningful determinations of Rq were made. 
Various methods and observational data were used to obtain these results, which sometimes 
substantially differ from each other. The uncertainties associated with this determinations 
also vary in wide range, and not always correspond to the real accuracy of reported results, 
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Table 1: Previous average estimates of Rq 



Paper 


Period covered 


R , kpc 


Kerr & Lvnden-Bell (1986) 


1974-1986 


8.5 ± 1.1 


Reid (1989) 


1974-1987 


7.7 ±0.7 


Reid (1993) 


1974-1992 


8.0 ±0.5 


Nikiforov C2004) 


1974-2003 


7.9 ±0.2 


Avedisova (2005) 


1992-2005 


7.8 ± 0.32 



which was not always properly assessed. For these reasons, it may be difficult for the 
user to choose the best Rq value suitable for his application. These shortcomings of the 
original measurements can be mitigated in an average value or a common mean (CM), 
which is a generally used procedure. Equally important is to derive a realistic uncertainty 
associated with this combined value. Both underestimated and overestimated uncertainties 
are undesirable. 

Table [1] shows the latest average estimates of Rq. All the three values are statistically 
consistent, i.e. lies inside the overlapping la intervals. However, the uncertainty of the first 
result is too large to s atisfy the need of many exacting applications. The reason may be that 



Kerr fc Lynden- Belli (119861 ) computed the uncertainty of the final result as the standard 



deviation (STD) of the individual determinations with respect to the unweighted mean 
value, and ign ored stati s tical e rror equal to 0- 2 2. On the other hand, the uncertainty of 
the results of iNikiforovl (120041 ) and lAvedisoval (120051 ) may be underestimated because it 
seems to be just a formal error coming from the least squares procedure without a check o f 
consistency of the data used in the computation. Notice that the result of lAvedisoval (120051 ) 
is less precise because , most probably , it is obtained from relati vely small set of data with 
respect to the work of lNikiforovl ( 120041 ) . The result of iReidl (119931 ) based on a comprehensive 
analysis of many data looks as the most realistic both with respect to Rq value and its 
uncertainty. However it was obtained about 20 years ago, and evidently needs revision. 
In should be noted that all these results were obtained with detailed considerations of the 
physical aspects of observational aspects, but maybe without sufficient statistical analysis 
of the published estimates. In particular, they used simple (un)weighted mean to compute 
a statistical estimate of CM without accounting for possible statistical discrepancy of the 
published data. 

A goal of this study is to derive a new best estimate of Rq. We will use all available 
determinations published during last 20 years (it will be shown below that earlier results 
do not contribute substantially to the final result). Besides we will use several statistical 
techniques which allow processing not only consistent but also discrepant data. This task 
is not uniqu e. It is well known metro logical problem actual for many fields of science and 
engineering ( ICox fc Harris! 120041 ) . In particular, in this study we conside r several methods 



developed for evaluatio n of the best estimates of some physical constants (IMacMahon et al. 



20041 ; IChen et al.ll201lf ) 



This paper is limited in scope to the statistical analysis of the Ro determinations found 
in the literature. We will not attempt to discuss the methods involved in the various deter- 
minations of Ro, either theoretical or observational. This study is primarily aimed at testing 
of a capability of different mathematical approaches in deriving the best current estimates 



2 



of the Galactic fundamental constants. 

The paper is organized as follows. Section [2] described statistical methods used to evaluate 
the best estimate of R . Section [3] presents all the available R estimates published during 
last 20 years and their preliminary analysis. The methods described in Section [2] will be 
used in Section H] to obtain the final result. The work is closed with Concluding remarks. 



2 Statistical methods used 

Let we have n measured values of a physical quantity Xi with associated reported STD 
(uncertainties) Sj, i — 1 . . . n. Our goal is to compute the best estimate of this quantity and 
its associated uncertainty. This task is known as the problem of computation of a common 
mean (CM). A solution of the problem is usually not so simple in practice. Input values 
Xi generally come from various research groups and are obtained with different methods, 
observational techniques and data sets. Also the uncertainties s i £1X6, clS cL rule, obtained not 
in uniform way, and their statistical properties are usually not known. These circumstances 
may make input data rather discrepant than consistent, and using simple statistical methods 
from textbooks on the subject like most often used classical weighted mean may be not 
justified. For these reas o n, several al t ernati v e statist i cal me thods were developed, see, e.g., 



MacMahon et al.l (120041 ): IChen et all (1201 lh : iMalkinl fl2011d ) and papers cited therein. For 



evaluation of Ro, several of these methods were selected that are most representative and 
relevant, in our opinion, to this study. They are briefly discussed in this section. 

Weighted mean (WM). This classical method is most widely used. The mean value 
is computed by 

x w = EtlPiXi , (1) 
V 

with STD 

&wi = —p , (2) 

VP 

using weights 

1 n 

Pi = ~, P = J2Pi- ( 3 ) 

S i i=l 

The same formulae (pQ) and §2§ are used in computation of CM is also used in the LRW, 
NR, and MP methods described below. However, these methods use modified procedures to 
compute the weights of the input measurements. 

Another important statistics is the reduced (normalized) \ 2 computed by 



dof n — 1 

It is close to unity for consistent measurements, and is much greater than 1 in case of 
discrepant data or underestimated reported uncertainties. 

Least squares approach leads to alternative estimate of the WM uncertainty. The solution 
of the least square problem Xi = x w + £j with weights pi gives the same WM value x w , but 
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another estimate of its uncertainty: 



< J w2 



\ p(n-l) 



(5) 



One can see that a w i, depends on s, and does not depend on the scatter of Xi, whereas 
a W 2 depends on both relative values of sf and the scatter of Xi. They are equal to each other 
in the case of consistent data, i.e. when /dof = 1. Difference between the two a estimates 
may be attributed to systematic errors in xi and/or underestimating of Sj. It's important 
to note that neither a m \ nor a w % use all the input information, namely scatter of the input 
measurements and absolute and relative values of their uncertainties. Afte r investigation 



of behavior of these estimates using simulated and real data, iMalkinl (J2001bj) proposed the 
combined estimate: 

°w3 = \l oil + 0I2 ■ (6) 

The latter appro ach can provide more stable and realistic uncertainty estimate in prac- 
tical cases, see, e.g., (IMalkinl l2001al ). More detailed discussion on these three appro aches to 



compu te the WM STD and several simulated and practical examples are given in iMalkin 
f l2011cf ). 



Another procedure to compute the WM uncertainty was proposed by lGray et al.l (119901 ). 
It uses Bayesian approach and is recommended by the authors for evaluation of experimental 
data in case of doubt about correctness of reported uncertainties. In this method, the CM 
is also computed by (pQ), but its STD is given by 



&w4 



\| p(n-3) 



(7) 



One can see that this approach is similar to the least square one, but gives a bit more 
conservative estimate of the uncertainty. 

Unweighted mean (UWM). The UWM estimate can be justified when the uncertain- 
ties of the input measurements are not given, not trusted or suspected to be no t cons istent. 
This may be also the case for Rq determinations, see e.g. iKerr fc Lynden-Belll (119861 ). Un- 
weighted CM estimate and its uncertainty is computed by 



2 u 



EI 



X .; 



n 



a 



[ Xj t x > 



A n(n — 1) 



(9) 



which corresponds to <j W 2 computed with unit weights. 

Limitation of relative weights (LRW). This procedure is adopted by the Nuclear 
Data Section of International Atomic Energy Agency (INicholsl 12004 ). It is developed to pre- 
vent a measurements with very small reported uncertainty from dominating CM. According 
to this method, no input value should have a relative weight Pi/p greater than 0.5. When 
such a value found, its uncertainty should be increased, and then a weighted mean should 
be recomputed. 
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Normalized residuals (NR). This method introduced by iJames et al.l ( 119921 ) is also 
intended to limit a relative statistical weights of input values. However, unlike LRW method, 
the uncertainties of only discrepant data are adjusted. For this purpose, the normalized 
residuals are computed by 

(10) 



/ PPi 
P-Pi 



(X; 



OCq, 



The limiting value of Ti is defined as 



r 



Vl.81nn + 2.6. 



For any measurement with |r,-| > r , the uncertainty of the i-th measurement is inflated. 
The procedure is started from the measurement with the largest |rj|, and repeated until no 
Ti will be greater than r$. 



Mandel-Paule (MP). The MP estimate of CM flPaule fc Mandellll982h is defined as 
x w computed by ([TJ with weights 

vn = -^-^ , (12) 



where sf is computed by solving the equation 



n 

£ 

i=i 



n 



1 . 



(13) 



This method treats the set of systematic errors of the input measurements as a source 
of random variability, and si is an estimate of the "inter-laboratory" variance. In fact, this 
method simply increases original uncertainties by an additive variance to make the normal- 
ized x 2 / dof equal to unity. So, its use is on ly justifi e d in a case of th e original y 2 /dof > 1. In 
this respect the MP approach is similar to iMalkinl (J2001b|, 1201 let ) . iRukhin fc Vangell (ll998f ) 
defined a modified MP procedure to be as previously with p instead of (p — 1) on the right 
side of ( fT3|) . The modified approach produced practically the same result in our case. 

Median (M). Another approach routinely used to get the estimate of CM is computation 
of a median x m . The median is known as a robust statistics less influenced by outliers. 
However its standard definition does not provide an estimate of error of a median value (it 
makes it immune to unreliable unc ertainties though). A possible approach to compute a 
median uncertainty can be found in iMiillerl (119951 . l2000al ). Let x m be the median of Xj, i.e. 
x m = med{xi}. Now we can compute the median of the absolute deviations (MAD) as 



MAD = med{\xi — x m \} 



The uncertainty of x m is then taken as 



1.8582 



MAD . 



(14) 



(15) 



One can see that this estimate of the median uncertai nty depends only on the data scatter 
and not on the input uncertainties. Later iMullerl (j2000bl ) proposed a method to take account 
of the uncertainties in input data and thus compute weighted median and its STD. However 
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its practical realization, as pointed out by the author, is more cumbersome, and the testing 
results and discussion given therein do not show clear advantage of using weighted median. 

Bootstrap median (BM). The bootstrap method is based on a Monte-Carlo procedure 
to estimate CM and its uncertainty. A random sample (with replacement) of length n is 
selected from input data, and the median x m j is computed. After repeating the sampling 
M times (j = 1 . . . M), the best estimate of CM and its STD are computed by 



* b = , (16) 



Ob 



\ M^l ' (17) 



To obtain a reliable estimates of x& and <Jb, sufficiently large number of sampling should 
be made. After testing, we found that M = 10, 000 provides stable result not changing 
substantially with increasing M. 

Note that three methods used in this work, UWM, M, and BM, do not use the uncer- 
tainties reported with data. 



3 Input Rq determinations 



In this section we strived to collect all the results of determination of Rq published during 
last 20 years. This interval was chosen because test computations with earlier data showed 
no significant difference with final result of this paper presented in Section HJ It will be shown 
below that the average result converges to a stable value during the period considered in 
this work. The reader interested in the historical data can refer to, e.g., iKerr fc Lynden-Bell 



(119861 ): iReidl ( 119931 ): ISurdml (119991 ): iNikiforovl (120041 ) . 

All the results found in the lit erature were us ed in this work with few exceptions. W e 



did no t use lGlushkova et al.l ([19981) later r evise d by Glushk ova et al. ([1999 uEisenhauer et al.. 
( 20031) revised by Eisenhauer et al. ( 2005 ). and Paczyhski &: Staneki ( 19981 ) revised by Stanek et al 
(l2000h . 



When both random (statistical) and systematic uncertainties are given, they were summed 
in quadrature. See Table [2] for details. 

If tw o different values are given for the l ower and upper boundari es of the confidence 
interval ( jGhez et al.l 120081 ; iReid et al.l l2009bt IVanhollebeke et al.l |2009J) , the mean value of 
these boundaries was used as the uncertainty of the result. When the authors give sev- 
eral estimates for R Q without final conclusion, an unweighted average of this estimates was 
computed. Details are given in Table [3j 

The first impression of the input data is a large spread in the reported uncertainties 
from 0.15 to 1.1 kpc with about 1.5 times improvement during last 20 years (Fig[T|). The 
value of this trend —0.0097 ± 0.0054 kpc/yr is statistically significant. A reason for such 
large scatter in uncertainty, except the real difference in the accuracy of the methods used, 
can be the fact that some authors may be not critical enough in estimating the error of 
the result. Besides, as mentioned above, only few authors give an estimate of systematic 
error of the reported result in addition to the usually used statistical (formal) error. All these 
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Table 2: R determinations with estimation of both statistical and systematic errors 



Paper i?o, kpc 



Nishivama et al. (2006) 


i? = 


= 7.52 


± 


0.10 


stat 


± 


0.35 


syst 


Groeneweeen et al. (2008) 


i?0 = 


= 7.94 


± 


0.37 


stat 


± 


0.26 


syst 


TriDDe et al. (2008) 


Rq = 


= 8.07 


± 


0.32 


stat 


± 


0.13 


syst 


Gillessen et al. (2009b) 


Rq = 


= 8.33 


± 


0.17 


stat 


± 


0.31 


syst 


Gillessen et al. (2009a) 


Rq = 


= 8.28 


± 


0.15 


stat 


± 


0.29 


syst 


Matsunaea et al. (2009) 


Rq = 


= 8.24 


± 


0.08 


stat 


± 


0.42 


syst 


Sato et al. (2010) 


Ro = 


= 8.3 


± 


0.46 


stat 


± 


1.0 


syst 



Table 3: 


Averaged Rq values 




Paper 


Original Rq 


estimates, kpc 


Glass et al. (1995) 


8.7 ±0.7 


8.9 ±0.7 


Lavden et al. (1996) 


7.6 ±0.4 


8.3 ±0.5 


Genzel et al. (2000) 


7.9 ±0.85 


8.2 ±0.9 


Groenewegen & Blommaert (2005) 8.6 ± 0.7 


8.8 ±0.4 


Shen k Zhu (2007) 


7.95 ± 0.62 


8.25 ±0.79 


Ghez et al. (2008) 


7.96 ± 0.63 


8.36 ±0.37 


Maiaess et al. (2009) 


7.7 ±0.7 


7.8 ±0.6 



factors can indicate discrepancy between determinations R , which requires careful statistical 
processing i nstead of a straightforward computation of a weighted mean. In particular, for 
this reason, Kerr fc Lynden-Belll ( 119861 ) proposed to consider unweighted mean value for the 
best estimate of Galactic constants, including R . 

It is important for further analysis to chec k the absenc e of the significa n t systemat i c drif t 



in i nput measurements. Ma ny authors, e.g., iReidl (119931 ) : ISurdinl (119991 ); INikiforovl (J2004) 



and iFoster fc Cooper! (|2010l) d iscuss ed the long-term trends in R estimates. IReidl ( 119931 ); 



Surdinl ( 119991 ) and iNikiforovl ( 120041 ) showed t he large nega tive slope in the 1970s to the 



beginning o f the 1990 s , As t o the last 20 years, ISurdinl (119991 ) showed practically no drift for 
1990-1998. INikiforovl (120041) showed zero or maybe small positive drift for 1990-2003, and 



Foster fc Cooperl (120101 ) revealed the large positive slope for 1992-2010. 



To clarify this contradiction we repeated the computation of the trend in R Q estimates 
during the last 20 years using the full data set considered in this paper. All the values 
listed in Table H] are depicted in Fig. [2] with the epochs computed with a fraction of the 
year based on the publication date taken from ADS or journal number. When the month 
is not specified, the year fraction 0.5 was accepted. In a few cases of coincident epochs, a 
small fraction of a year was voluntarily added or subtracted. From processing of this data 
we obtained the value —0.011 ± 0.011 kpc/yr for weighted slope and ±0.006 ± 0.010 kpc/yr 
for unweighted one. 

Comparing these data with Foster fc Cooper ( 2010l ) we can conclude that our results do 
not confirm the large trend in the R estimates observed by the authors. Our computations 
also revealed a li near trend bu t mu ch smaller and st atistically unreliable, which corresponds 
to the results of lSurdinl dl999h and INikiforovl (120041 ). 

Another simple but effective test of the significance of the long-term trend in the data 
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Table 4: Ro estimates used in this study 
Ro, kpc STD Reference 

73 08 Merrifield (19921 

8.1 1.1 Gwinn et al. (1992) 

7.6 0.6 Moran et al. (1993) 

7.6 0.4 Maciel (1993) 

8.09 0.3 Pont et al. (1994 '! 

7.5 1.0 Nikiforov k Petrovskava (1994) 

7.0 0.5 Rastorguev et al. (1994) 

8.8 0.5 Glass et al. (1995) 

7.1 0.5 Dambis et al. (1995) 

8.3 1.0 Carnev et al. Q995) 
8.21 0.98 Huterer et al. (1995) 
7.95 0.4 Lavden et al. (1996) 
7.55 0.7 Honma k Sofue (1996) 
8.1 0.4 Feast (1997) 

8.5 0.5 Fea sT k Whitelock (1997) 

7.66 0.54 Metzger et al. (1998) 
8.1 0.15 Udalski (1998 ) 

7.1 0.4 Oiling k Merrifield (1998) 

8.51 0.29 Feast et al. (1998) 

8.2 0.21 Stanek k Garnavich (1998) 

8.6 1.0 Surdin (1999) 

7.4 0.3 Glush kova et al. (1999) 

7.9 0.3 McNamara et al. (2000) 

8.67 0.4 Stanek et al. (2000) 

8.2 0.7 Nikiforov (2000) 
8.24 0.42 Alves (2000) 

8.05 0.6 Genzel et al. (2000) 

8.3 0.3 Gerasimenko (2004) 

7.7 0.15 Ba busiaux k Gilmore (2005) 

8.01 0.44 Avedisova (2005) 

7.62 0.32 Eisenhauer et al. (2005) 

8.7 0.6 Groenewegen k B lommaert (2005) 

7.2 0.3 Bica et al. (2006) 

7.52 0.36 Nishivama et al. (2006) 

8.1 0.7 Shen k Zhu (2007) 

7.4 0.3 Bobvlev et al. (2007 ) 
7.94 0.45 Groenewegen et al. (2008 ) 
8.07 0.35 Trippe et al. (2008) 

8.16 0.5 Ghez et al. (2008) 

8.33 0.35 Gil lessen et al. (2009b) 

8.7 0.5 Van hollebeke et al. (2009) 

7.58 0.40 Dambis (2009 ) 

7.2 0.3 Bo natto et al. (2009) 
8.4 0.6 Reid et al. (2009a ) 
7.75 0.5 M aiaess et al. (2009) 
7.9 0.75 Reid et al. (2009b) 
8.24 0.43 Matsunaga et al. (2009) 
8.28 0.33 Gillessen et al. (2009a ) 
7.7 0.4 Dambis (2010) 

8.1 0.6 Maiaess (2010) 

8.3 1.1 Sato et al. (2010 ) 

7.80 0.26 Ando et al. (2011) 
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Figure 1: Uncertainty of R estimates used in this study. Unit: kpc. 
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Figure 2: R estimates used in this study. The weighted (solid line) and unweighted (dashed 
line) trend is also shown. Unit: kpc. 
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Table 5: Average estimates of Ro obtained by different statistical methods. The second and 
third columns contain references to formulas used for computation of the result given in the 
last column. See Section [2] for method designations 



Method 


R , kpc 


X 2 /dof 


WM with a wl 


7.909 ±0.051 


1.227 


WM with a w2 


7.909 ± 0.056 




WM with a w3 


7.909 ± 0.076 




WM with a w4 


7.909 ± 0.058 




UWM 


7.960 ± 0.062 




LRW 


7.909 ± 0.051 


1.227 


NR 


7.909 ± 0.051 


1.227 


MP 


7.911 ±0.059 


1.000 


M 


8.060 ± 0.075 




BM 


8.028 ± 0.083 





(not necessarily linear) is the Abbe criterion. Strictly speaking, it is intended to test the null 
hypothesis that all the measurements have the same mathematical expectation. The Abbe 
statistics ca n be computed as the ratio between the Allan variance AVAR (see details of this 
statistics in iMalkinl fl2008l l2011bh ) and the variance of the measurements VAR: 

AVAR 
Q ~ ~VAR' 

AVAR = ^=i(yi-yi + i) 2 (18) 
2n-l) 



VAR 



n — 1 



,2 



where the mean value x is computed by (jSj). 

In presence of significant trend, one can expect that VAR will be substantially greater 
than AVAR. In other words, if the value of q is less than corresponding percentage point, we 
have to reject the null hypothesis, i.e. confess that the input data contain systematic shift. 
In our case, we have obtained q = 1.26, which is well greater than the 1% critical value 0.69. 

Finally, we can conclude that no statistically significant trend is found in the analyzed 
data during last 20 years. 



4 Evaluation of R 

The results Rq determination in 1992-2011 listed in Table H] were processed with 10 statistical 
statistical techniques described in Section [2] The final results obtained by all the methods 
are presented in Table El 

For MP estimate, we also have got an estimate of the inter-laboratory error s&, which 
is equal to 0.16 kpc. It shows a level of consistency between the input Ro determinations. 
Also, this value was added in quadrature to the original uncertainties according to ( TT2|) to 
derive the final MP estimate. 
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Figure 3: Convergence of R estimates. The abscissa axis shows the number of input esti- 
mates counted back from the latest one. Unit: kpc. 



It should be noted that the uncertainty of the LRW, NR, and MP estimates originally 
computed as a w \ can also be computed as a W 2, &w3 or a w ^. We made corresponding calcula- 
tions, but in our case the results are identical to the WM, and are not presented here. For 
more discrepant data these modifications may worth consideration. 

Figure [3] shows the convergence of the results obtained with different statistical methods 
with respect to the number of input data used in computation of CM. The data sets used for 
this computation always begins with the last determination and have different extents in the 
past. This test is intended to estimate how much historical data contribute to the current 
best estimate. From this test, we can conclude that only the last 35-40 determinations made 
during the last about 15 years practically define the current best estimate of R . 

There is no well-founded choice between results obtained with different statistical tech- 
niques. For this reason, it seems reasonable to accept the final R® value as the middle of 
the interval overlapping all the estimates of CM with their la uncertainties, namely [7.909- 
0.076, 8.060+0.075] = [7.833, 8.135]. A half of the span of this interval can be accepted as 
the uncertainty of the final i?o estimate. Thus, the following final value of the Galactocentric 
distance can be accepted: 

Ro = 7.98 ±0.15 kpc. (19) 



In the framework of the classical method, the value (Tl9|) considered as a confidence 
interval corresponds to the confidence probability of 0.995. However this value of uncer- 
tainty is only statistical one indeed. Various considerations on possible sources and va l- 



ues of svstematic errors in determinations of Rn can be found in Nishivama et al. 


(2006h: 


Kerr & Lvnden-Bcll ( 


L986): 


Reid (1993): Nikiforov C 


200^ 


.); Groeneweeen et al. (2008): 


Nikiforov 


(2008 


); TripDe et al. 


(2008 


): Gillessen et al. (2009; 


0; : 


Vlatsunaea et al. (2009): Sato et al. 


(2010 


). Analysis of the systematic errors of the published Rq estimates is beyond the scope 
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of this study. However, it should be mentioned th at results of test comp utations with the 
original and recomputed Rq estimates suggested by iNikiforovl ( 120041 120081 ) did not show any 
significant impact on the final value. 

Taking into account that the Rq measurements analyzed in this study were derived from 
observations of different objects and making use of several different methods with differ- 
ent systematic errors, we can presume that these errors are generally independent between 
methods, and hence are substantially mitigated in the averaged value ( 1191) . Perhaps 0.15-0.2 
kpc is a realistic estimate for remaining systematic error in the average result. On the other 
hand, a bandwag o n effe ct in the R measurements is suspected by many authors, e.g., iReid 
( 119931 ); INikiforovl (120041 ) to be substantial. It may bias the current best estimate, but there 
is no way to quantify this effect. 

Finally, the following value of the Galactocentric distance of the Sun has been obtained 
in this study: 

R = 7.98 ± 0.15 | s tat ± 0.20 | syst kpc. (20) 



which can be considered as the current best estimate of R . 
applications the value of Rq = 8.0 ± 0.25 kpc can be sufficient. 



For most of the practical 



5 Concluding remarks 

In this paper we present a new current best estimate of the distance from the Sun to the center 
of the Galaxy, Rq, derived from comparison of estimates obtained with different statistical 
techniques. Some techniques gives mostly "theoretical" estimates without accounting for 
possible discrepancy in the data used. Other were developed just to evaluate discrepant 
data and thus should give more realistic estimates. 

In spite of R estimates used in this study were obtained by different methods and object 
samples, our analysis has shown that they are rather consistent than discrepant. From the 
results given in Table |5j one can see that the same or very close values were obtained with 
WM, LRW, NP, and MP methods. This is an evidence that there are no large systematic 
errors in the estimates available in literature. However, although the estimates derived by 
several variants of the weighted mean approach give close values of Rq, their uncertainties 
differ substantially ranging from 0.51 to 0.76 kpc. Besides, the estimates derived by two 
variants of median-family technique differ substantially from the WM-family results. All 
this means that Rq determinations made during the last 20 years are not fully consistent 
yet, and using and comparison of different statistical techniques is justified to obtain a 
reliable combined Rq estimate. The same can be said about other astronomical constants 
derived from combined processing of different determinations. 



It is important tha t our R n e s timat e is quite consistent with the va l ues fo und by IReid 



( 1l998h (8.0 ± 0.5 kpc), INikiforovl too4 ) (7.9 ± 0.2 kpc), and lAvedisoval (l2005h (7.8 ± 0.32 



kpc). We can conclude from this that the Rq value is to the moment well established around 
8.0 kpc with an error of a few tenths of kpc. This fact, together with absence of a significant 
trend in the Rq determination made during last 20 years, can lead to a conclusion that the 
bandwagon effect may be not so significant as suspected. 

Finally, the fast convergence of the R determinations found in this study open a possibil- 
ity of improving R uncertainty substantially during coming years. It is interesting to notice 
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that the results listed in Tabled] also show a strong tendency to decreasing the R error with 
time. Unfortunately, it is mainly explained not by improvement of the observational results, 
but rather by inhomogeneity of the methods used to estimate the uncertainty of the average 
value, as discussed above. 
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